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Abstract 

Tight frames and rank-one quantum measurements are shown to be intimately related. In 
fact, the family of normalized tight frames for the space in which a quantum mechanical sys- 
tem lies is precisely the family of rank-one generalized quantum measurements (POVMs) on 
that space. Using this relationship, frame-theoretical analogues of various quantum-mechanical 
concepts and results are developed. 

The analogue of a least-squares quantum measurement is a tight frame that is closest in a 
least-squares sense to a given set of vectors. The least-squares tight frame is found for both 
the case in which the scaling of the frame is specified (constrained least-squares frame (CLSF)) 
and the case in which the scaling is free (unconstrained least-squares frame (ULSF)). The well- 
known canonical frame is shown to be proportional to the ULSF and to coincide with the CLSF 
with a certain scaling. 

Finally, the canonical frame vectors corresponding to a geometrically uniform vector set are 
shown to be geometrically uniform and to have the same symmetries as the original vector set. 
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1 Introduction 

Frames are generalizations of bases which lead to redundant signal expansions ^ . A frame for 
a Hilbert space W is a set of not necessarily linearly independent vectors that spans lA and has 
some additional properties. Frames were first introduced by Duffin and Schaeffer in the context 
of nonharmonic Fourier series, and play an important role in the theory of nonuniform sampling 
[0, ^. Recent interest in frames has been motivated in part by their utility in analyzing wavelet 
expansions [Q, ^. 

Many efforts have been made to construct bases with specified properties. Since the conditions 
on bases are quite stringent, in many applications it is hard to find "good" bases. The conditions 
on frame vectors are usually not as stringent, allowing for increased flexibility in their design |^, ^. 
For example, frame expansions admit signal representations that are localized in both time and 
frequency |||] , as well as sparse representations . 

Frame expansions have many other desirable properties. The coefficients may be computed 
with less precision than the coefficients in a basis expansion for a given desired reconstruction 
precision ; the effect of additive noise on the coefficients on the reconstructed signal is reduced in 



comparison with a basis expansion ^, ^, 10 1 ; and the coefficients are more robust to quantization 



degradations |11, 12 1. Recently, frames have been applied to the development of modern uniform 



and nonuniform sampling techniques |13], to various detection problems [0, 15], and to the analysis 



and design of packet-based communication systems [16|. 

A tight frame is a special case of a frame for which the reconstruction formula is particularly 
simple. As we show in Section ^ a tight frame expansion of a signal is reminiscent of an orthogonal 
basis expansion, even though the frame vectors in the expansion are linearly dependent. Tight 
frames are particularly popular, and will be the focus of this paper. 

Frame-like expansions have been developed and used in a wide range of disciplines. Many 
connections between frame theory and various signal processing techniques have been recently 



discovered and developed. For example, the theory of frames has been used to analyze and design 



oversampled filter banks [17, 18 1 and error correction codes [19|. Wavelet families have been used 
in quantum mechanics and many other areas of theoretical physics, particularly in the study of 
semiclassical approximations to quantum mechanics Q. 

In this paper we explore yet another connection between quantum mechanics and tight frames. 
Specifically, we show that the family of (normalized) tight frames for a subspace li in which a quan- 
tum mechanical system is known to lie is precisely the family of possible generalized measurements 
(POVMs) on U. Exploiting this equivalence, we can apply ideas and results derived in the context 
of quantum measurement to the theory of frames and vice versa. 

We begin in Section |3| by characterizing quantum measurements. With each rank-one quantum 
measurement we associate a measurement matrix. Using the measurement matrix representation, 
we give a simple and constructive proof of Neumark's theorem |20|, which relates general quan- 
tum measurements to orthogonal measurements. We then discuss the problem of constructing 
measurements optimized to distinguish between a set of non-orthogonal pure quantum states. 

We then follow a similar path in Section ^ for tight frames. We associate a frame matrix with 
every tight frame, which as we show has essentially the same properties as a quantum measurement 
matrix. Next, we derive an analogue of Neumark's theorem for tight frames, which expresses tight 
frame vectors as projections of a set of orthogonal vectors in a larger space. Finally, motivated 
by the construction of optimal quantum measurements, we consider the problem of constructing 
optimal tight frames for a subspace U from a given set of vectors that span li. 

The problem of frame design has received relatively little attention in the frame literature. 
Typically in applications the frame vectors are chosen, rather than optimized. Iterative algorithms 



for constructing frames that are optimal in some sense are given in |21|. Methods for generating 
frames starting from a given frame are described in Q. 

A popular frame construction from a given set of vectors is the canonical frame |8|, |l^, 22, p^ . 
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first proposed in the context of wavelets in |24]. The canonical frame is relatively simple to construct, 
can be determined directly from the given vectors, and plays an important role in wavelet theory 



26, 27 1 . However, no general optimality properties are known for the canonical frame. 



In Section ^ we systematically construct optimal frames from a given set of vectors. Motivated 
by the least-squares measurement [^8| derived for quantum detection, we seek a tight frame con- 
sisting of frame vectors that minimize the sum of the squared norms of the error vectors, where the 
ith error vector is defined as the difference between the ith given vector and the ith frame vector. 
We consider both the case in which the scaling of the frame is specified, and the case in which the 
scaling is such that the error is minimized. When the scaling is specified the optimizing frame is 
referred to as the constrained least-squares frame (CLSF), and when the scaling is not specified the 
optimizing frame is referred to as the unconstrained least-squares frame (ULSF). 

In Section |^ we show that the canonical frame vectors are proportional to the ULSF vectors, 
and that they coincide with the CLSF vectors with a specific choice of scaling. 

An important issue is to what extent frames constructed from a given set of vectors inherit the 



properties of the original vector set |22]. For example, it has been shown that when constructing 
normalized Gabor frames from windows satisfying certain decay conditions using the canonical 
frame construction, the resulting tight frame has similar decay properties In Section ^ we 

consider the case in which the original vectors have a strong symmetry property called geometric 
uniformity [^] . Based on results derived in the context of quantum detection [^] we show that the 
CLSF vectors and the ULSF vectors have the same symmetries as the original vectors. This implies 
that the canonical frame vectors associated with a geometrically uniform vector set are themselves 
geometrically uniform. 

Before proceeding to the detailed development, in Section ^ we first provide an overview of the 
notation and some mathematical preliminaries. 
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2 Preliminaries 



In this section we briefly review elements of linear algebra that are common to both signal processing 
and quantum mechanics. Our main goal is to characterize "transjectors" (partial isometries) using 
the singular value decomposition (SVD). 

2.1 Hilbert spaces and operators 

In both signal processing and quantum mechanics, the setting we consider is a finite-dimensional 
subspace W of a complex Hilbert space H. The elements of H are called vectors. We will often 
assume for notational convenience that H. is finite-dimensional, with dimT^ = k; then by appropriate 
choice of coordinates we can identify H with C*^. 

In signal processing, the elements of Ti are regarded as column vectors and denoted, e.g., by 
X gH. Then x* denotes the row vector which is the conjugate transpose of x. The inner product of 
two vectors is a complex number, denoted, e.g., by {x,y) = x*y. An outer product of two vectors 
such as xy* is a rank-one matrix, which as an operator takes z e H to xy*z = {y, z)x G H. 

The Dirac bra-ket notation of quantum mechanics expresses such concepts very nicely. We 
believe that the signal processing community would do well to master it; however, recognizing that 
it is unfamiliar, we do not rely on it in this paper. Nonetheless, to assist the reader unfamiliar with 
this notation in reading the quantum literature, we will give the bra-ket equivalents for various 
expressions in this section. 

In the bra-ket notation, the elements of TL are "ket" vectors, denoted, e.g., by \x) G Ti. The 
corresponding "bra" vector {x\ is an element of the dual space H* and may be regarded as the 
conjugate transpose of \x). The inner product of two vectors is a complex number denoted by 
{x\y). An outer product of two vectors such as \x){y\ is a rank-one matrix, which as an operator 
takes \z) eHto \x){y\\z) = {y\z)\x) G H. 

An operator on 7^ is a linear transformation A : H ^ H. The adjoint of an operator A is the 
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unique operator A* such that {x,Ay) = {A*x,y) for all x,y eH. If the elements of H are column 
vectors, then an operator A is represented by a square matrix, and its adjoint is represented by the 
conjugate transpose A*, since {x,Ay) = x* Ay = {A*x)*y = {A*x,y). 
An operator A is called Hermitian if it is self-adjoint; i.e., if A* = A. 

An orthogonal projector P is a Hermitian operator on H such that = P. Consequently, 
the eigenvalues of P all equal or 1. If {ui} is a set of eigenvectors corresponding to the nonzero 
eigenvalues of P, then the subspace U CH spanned by the set {ui} is the range of P, and we write 
the projector as Pu- A one-dimensional projector has a single eigenvector u and may be written as 
the outer product Pu = uu* (or P„ = \u){u\ in bra-ket notation); then P^ projects any x E H into 
the projection P^x = {u,x)u (or \u){u\x)). An r-dimensional projector Pk may be written as the 
sum of r one-dimensional projectors, Pu = X^jPuj, where {ui} is any basis for U. 

2.2 Transjectors (pEirtial isometries) 

Let P be a rank-r matrix whose columns are a set of n vectors cpi E H. Then F*x is a vector 
in C" whose components are the inner products {(pi,x). In other words, F* may be regarded as 
a linear transformation F* : TC —y C^. Similarly, F may be regarded as a linear transformation 

F ■.c'^n. 

It is well known in signal processing (but not as well known in quantum mechanics) that any 
such matrix F has an SVD P = WEV* , where [/ is a unitary matrix whose columns {ui G H} 
are the eigenvectors of the Hermitian operator T = FF*, F is a unitary matrix whose columns 
{vi G C"} are the eigenvectors of the Hermitian matrix S = F*F (the Gram matrix of inner 
products), and S is a positive real diagonal matrix whose r nonzero values aj, called the singular 
values of P, are the positive square roots of the nonzero eigenvalues of either S or T. Thus we may 
write P = (TiUiV* (or P = crj|uj)(wj|), a sum of r rank-1 outer products. 

An outer product such as Uivl (or |'Uj)(t!j|) is called a one-dimensional transjector. The trans- 
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jector UiV* takes a basis vector Vi G C" to the corresponding basis vector Ui G H. By linear 
superposition, it therefore takes a general element x = "^jivj, x)vj G C" to UiV*x = {vi,x)ui G H. 
Similarly, the adjoint transjector ViU* takes y = ^j{uj,y)uj to ViU*y = {ui,y)vi G C". 

The subspace spanned by the r eigenvectors Ui e H corresponding to the r nonzero eigenvalues 
oi S = F*F will be denoted as U C. H, and the subspace spanned by the r eigenvectors Vi G C" 
corresponding to the r nonzero eigenvalues of T = FF* will be denoted as V C C". The image of 
F is U, and the image of F* is V; the kernel of F is the orthogonal complement V"*" of V, and the 
kernel of F* is U~^. F operates by first performing an orthonormal expansion of C" using the basis 
{vi}, scaling each component by (jj, and then "transjecting" to W C K by replacing each Vi by the 
corresponding Ui. F* similarly "transjects" from H to V C C". 

A rank-r matrix F is called an r -dimensional transjector if its r nonzero singular values are all 
equal to 1. In other words, F = UZyV*, where U and V are unitary and 



n 



Ir 












Equivalently, FF* = U(ZrZ*)U* = Pjj is an r-dimensional orthogonal projector onto an r- 
dimensional subspace U ^ H with an orthonormal basis {ui G H, 1 < i < r} (the W-basis) 
consisting of the first r columns of U, and F*F = V{Z*Zr)V* = P\> is an r-dimensional orthogonal 
projector onto an r-dimensional subspace V C with an orthonormal basis {vi G C", 1 < i < r} 
(the V-basis) consisting of the first r columns of V. 

The SVD F = U Z^V* thus reduces to a sum of r one-dimensional transjectors (outer products): 

r 

F = Y,u,vX. (2) 
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Ji u eU, then u = Yll=i{ui, u)ui, and 



r 



F*u = 



(3) 



i=l 



i.e., F* "transjects" u to a corresponding vector v eV. Similarly, if u G V, then 



r 



Fv = '^{vi,v)ui; 



(4) 



i=l 



i.e., F performs the inverse map from V to W. If u G H, then F* first projects u onto U and then 
"transjects" to V as above; similarly, for a general v G C", F first projects v onto V and then 
"transjects" to U as above. 

An r-dimensional transjector F is also called a partial isometry, because it is an isometry 
(distance-preserving transformation) between the subspaces U C.H and V C C". Indeed, if v,v' &V 
and u = Fv, u' = Fv' , then 



so inner products and a fortiori squared norms and distances are preserved. Similarly, if u, u' G U, 
then {F*u,F*u') = {u,u'). However, inner products are not preserved if u,u' or v,v' ^ V. 
This discussion is summarized in the following theorem: 

Theorem 1 (Transjectors (partial isometries)). The following statements are equivalent for 
a matrix F whose columns are n vectors in a complex Hilbert space H: 

1. F is a transjector (partial isometry) between r-dimensional subspaces U CTC and V C C"; 

2. FF* = Pu for an r-dimensional subspace U C H; 

3. F*F = Py for an r-dimensional subspace V C C". 



{u,u') = u*u' = v*F*Fv' = v*Pvv' = v*v' = {v,v') 



(5) 
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A transjector F between r- dimensional subspaces U C TC and V C C" may be expressed as 
F = UZrV* , where U is a unitary matrix whose first r columns {ui, 1 < i < r} are an orthonormal 
basis forU, V is an nx n unitary matrix whose first r columns {vi, 1 < i < r} are an orthonormal 
basis for V, and Zr is given by ^). Equivalently, F = J2l=i ^i^i- 

A transjector F : C" U (resp. F* : 7i ^ V) is an isometry if restricted to V (resp. lA ). 



3 Quantum Measurement 



In this section we present some elements of the theory of quantum measurement, fohowing [28| and 
unpubhshed work in . In the remainder of the paper we will develop analogous results for tight 
frames. 

A quantum system in a pure state is characterized by a normalized vector (/> in a Hilbert space 
7i. Information about a quantum system is extracted by subjecting the system to a measurement. 
In quantum theory, the outcome of a measurement is inherently probabilistic, with the probabilities 
of the outcomes of any conceivable measurement determined by the state vector (p £ TC. 

A quantum measurement is described by a collection of Hermitian operators {Qi} on 7i, where 
the index i corresponds to a possible measurement outcome. The laws of quantum mechanics 
impose certain mathematical constraints on the measurement operators. 

In the simplest case, the measurement operators are rank-one operators and have the outer- 
product form Qi = Hill* for some nonzero vectors /ij G 7i. Such measurements will be called 
rank-one measurements, and the vectors /ij will be called the measurement vectors. 

If the state vector is 0, then the probability of observing the ith outcome is 

p{i) = {<P,Qi(t)) = \{liiA)?. (6) 
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To ensure that the probabihties p{i) sum to 1 for any normahzed (p € TC, we impose the constraint 

Y.Qi = lH, (7) 

i 

where I-j-i is the identity operator on Ti; then 

Y^p{i) = {4>,Y^Qi(t>) = {cP,<P) = l. (8) 

i i 

We distinguish between standard (von Neumann) measurements and generahzed measurements, 
or positive operator- valued measures (POVMs). In a standard measurement, the measurement 
operators {Qi} form a complete set of orthogonal projectors. Thus 



QiQi — Qi) (9) 

Q,Qj = 0, iii^j; (10) 
^Qi = Ih- (11) 

i 

If the measurement is rank-one, so that Qi = fiip.*, then ^ and ( p!o[ ) imply that ^j) = 6ij, 
while ([lT| ) implies that 

X = I-HX = '^{Hi,x)ni, Vx G (12) 

i 

SO the measurement vectors {/ij} form an orthonormal basis for 7i. 

Sometimes a generalized measurement is a more efficient way of obtaining information about 



the state of a quantum system than a standard measurement |2C]. A generalized measurement 
consists of a set {Qi} of nonnegative Hermitian operators, not necessarily projectors, that satisfy 
^iQi = Ih- Such a set of operators is termed a POVM. If the measurement is rank-one so that 
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Qi = HiH*, then the measurement vectors fii must satisfy 



^fiiH* = In- (13) 



A POVM is more general than a standard measurement in that the measurement vectors fii are 
not required to be either normahzed or orthogonal. 

It can be shown that a generalized measurement on a quantum system can be implemented by 
introducing an auxiliary system and performing standard measurements on the combined system. 



We will discuss this property in Section |3.2| in the context of Neumark's theorem; in Section 



we 



show that this property has an analogue for tight frames. 



3.1 Measurement Matrices 



A rank-one POVM acting on an r-dimensional subspace U '^TC in which the system to be measured 
is known a priori to lie is defined by a set of n measurement vectors {^i, 1 < i < n} that satisfy 

n 

^f^it^*=Pu, (14) 

i=l 

i.e., the n operators Qi = fiifi* must be a resolution of the identity on U^. 

The measurement matrix M corresponding to a set of measurement vectors fii €z U is defined 



as the matrix of columns fii [28|. We have immediately from ( [L4| ) that 

MM* = Pu- (15) 



Thus a matrix M with n columns in is a measurement matrix for states in the subspace U Q7i 
if and only if M satisfies (p^). 

^ Often these operators are supplemented by a projection Qo = — I-^ — Pjj onto the orthogonal subspace 

C 7i, so that YliT=o ~ — augmented POVM is a resolution of the identity on H.. 
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It follows immediately from Theorem |l| that a measurement matrix M with n columns in Ti 
corresponds to a rank-one POVM acting on an r-dimensional subspace U (^7i and only if M is 
a transjector (partial isometry) between U and an r-dimensional subspace V C C". Thus M has 
all the properties enumerated in Theorem 

A measurement matrix M represents a standard measurement if and only if its n columns are 
orthonormal; i.e., if and only if its Gram matrix satisfies M*M = In. Then M has rank n, U has 
dimension n, V = C", and M = U ZnV* for unitary U and V, where is given by 



Zn 



In 



(16) 



We summarize the properties of measurement matrices in the following theorem. 

Theorem 2 (Measurement matrices). The following statements are equivalent for a matrix M 
whose columns are n vectors in a complex Hilbert space 7i: 

1. M is a measurement matrix corresponding to a rank-one POVM acting on an r-dimensional 
subspace U C Ti; 

2. M is a transjector (partial isometry) between r-dimensional subspaces U CTi and V C C"; 

3. MM* = Pu for an r-dimensional subspace U '^TC; 
4- M*M = Py for an r-dimensional subspace V C C". 

A measurement matrix M corresponding to a rank- one POVM acting on an r-dimensional 
subspace U Q 7i may be expressed as M = UZ^-V*, where U is a unitary matrix whose first r 
columns {uj, 1 < z < r} are an orthonormal basis forU, V is an n x n unitary matrix whose first 
r columns {vi, 1 < i < r} are an orthonormal basis for V, and Z^ is given by Equivalently, 
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A measurement matrix M is an isometry if restricted to V. 

A measurement matrix M whose columns are n vectors in 7i represents a standard measurement 
if and only if its rank is n. Then M = U ZnV* , where Zn is given by ^Tdj), and M*M = 

3.2 Neumark's Theorem 



Neumark's theorem |2Cl|] guarantees that any POVM with measurement vectors ^ lA can be 
reahzed by a set of orthonormal vectors /xj in an extended space U such that U C U, so that 

Using the measurement matrix characterization of a POVM and the SVD, we now obtain 
a simple statement and proof of Neumark's theorem. Moreover, our proof is constructive; we 
exphcitly construct a set of orthogonal measurement vectors such that their projections onto lA are 



the original measurement vectors. In Section 4.2 we use this construction to extend a tight frame 
into an orthogonal basis for a larger space. 

Theorem 3 (Neumark's theorem). Let M he a rank-r measurement matrix of an arbitrary 
POVM, with n columns in a complex Hilbert space 7i. In other words, M is a transjector between 
an r -dimensional suhspace U 7i and an r -dimensional subspace V C C". Then there exists a 
standard ( von Neumann ) measurement with measurement matrix M which is a transjector between 
an expanded n-dimensional subspace U ^ U in a possibly expanded complex Hilbert space TC ^ 7i 
and C", and whose projection onto U is M = PuM . 

Proof Using Theorem ^ we may express M as M = UZrV*. Let Ui and Vi denote the columns of 
U and V respectively. Assume that TC is finite-dimensional, and let k = dim TC. 

We distinguish between the case k > n {i.e., M has at least as many rows as columns), and the 
case k < n (i.e., M has more columns than rows). 

In the case k > n, define M = J27=i ^i^j*! then U C 7i is the n-dimensional subspace spanned 
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by {ui, 1 < i < n}. The projection of M onto U is 



PuM = UjU* U^V* = Y = (17) 

j=l i=l 4=1 

Moreover, the columns of M are orthonormal, since its Gram matrix is 

n n n 

M*M = Y VjU* J2 = Yl = (1^) 

j=l i=l 1=1 

In the case k < n, first embed U in an n-dimensional space U in an expanded complex Hilbert 
space TC ^ TC, and let {{tj, 1 < i < n} be an orthonormal basis for U of which the first m vectors 
are the Z^-basis. Then proceed as before, using Ui in place of Uj. □ 

It is instructive to consider the matrix representation of M in both cases. Recall that M = 
UZrV*, where Zr is given by (|l|). 

In the case k > n, we construct M simply by extending the identity matrix along the diagonal; 
thus M = UZnV* where Z„ is given by (p!6|). Thus, when k > n, the left and right unitary matrices 
in the SVD of M and M are the same, and are equal to U and V, respectively. 

If k = n, then Zn = In and M = UV* . 

In the case k < n, we first replace the left unitary matrix U by U, and thus replace A; by A: = n; 
then [/ is an n X n unitary matrix whose first r columns are the U-hasis (where we append n — k 
zeros to each basis vector m). We then define M = UV*. 

Examples of the construction of the orthogonal measurement vectors associated with a given 



POVM along the lines of this proof will be given in Section 4.2, in the context of frames. 
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3.3 Optimal Quantum Measurements 

We now recapitulate some results on optimal quantum measurements according to various criteria, 
which will be relevant to the construction of optimal tight frames. 

Let {(pi, 1 < i < n} be a collection n < k normalized vectors in a /c-dimensional com- 
plex Hilbert space Ti., representing different preparations of a quantum system. In general these 
vectors are non-orthogonal and span an r-dimensional subspace U '^Ti. The vectors are linearly 
independent if r = n. 

To distinguish between the different preparations, we subject the system to a measurement. 
For our measurement, we restrict our attention to POVMs consisting of n rank-one operators of 
the form Qi = ^Uj/i* with measurement vectors fii G U. We do not require the vectors /Xj to be 
orthogonal or normalized. However, to constitute a POVM on lA the measurement vectors must 
satisfy 

If the states are prepared with equal prior probabilities, then the probability of detection error 
using the measurement vectors fii is given from ^ by 

1 " 

^'e = l--J^|(M.,</'^)|'. (19) 
i=l 

If the vectors /Uj are orthonormal, then choosing fii = <j)i results in = 0. However, if the 
given vectors are not orthonormal, then no measurement can distinguish perfectly between them. 
Therefore, a fundamental problem in quantum mechanics is to construct measurements optimized 
to distinguish between a set of non-orthogonal pure quantum states. 

This problem may be formulated as a quantum detection problem, so that the measurement 
vectors are chosen to minimize the probability of detection error, or more generally, minimize the 
Bayes cost. Necessary and sufficient conditions for an optimum measurement minimizing the Bayes 
cost have been derived p^, 33 1. However, except in some particular cases |33, 34, IsH], obtaining 
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a closed-form analytical expression for the optimal measurement directly from these conditions is 
a difficult and unsolved problem. 



An alternative approach proposed in |25] is to choose a different optimality criterion, namely a 
squared-error criterion, and to seek measurement vectors that minimize this criterion. Specifically, 
the measurement vectors are chosen to minimize the sum of the squared norms of the error vectors, 
where the ith. error vector is defined as the difference between the ith state vector and the ith 
measurement vector. The optimizing measurement is referred to as the least-squares measurement 
(LSM). 

It turns out that the LSM problem has a simple closed-form solution which has many desirable 
properties. Its construction is relatively simple; it can be determined directly from the given 
collection of states; it minimizes the probability of detection error when the states exhibit certain 



symmetries |28]; it is "pretty good" when the states to be distinguished are equally likely and 



almost orthogonal |36]; and it is asymptotically optimal |37]. 

In the next section we will develop a relationship between POVMs and tight frames. We 
then apply ideas and results derived in the context of quantum detection to the construction and 
characterization of tight frames. In particular, we will apply the squared-error criterion developed 
in 1 28] to the construction of optimal tight frames. 
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4 Tight Frames 



Frames, which are generahzation of bases, were introduced in the context of nonharmonic Fourier 
series by Duffin and Schaeffer Q (see also [^). Recently, the theory of frames has been expanded 
[Q, ^, |8|, ^, in part due to the utility of frames in analyzing wavelet decompositions. Here we will 
focus on tight frames, which have particularly nice properties. 

Let {ipi, 1 < i < n} denote a set of n vectors in an r-dimensional subspace U of a Hilbert space 
Ti.. The vectors tpi form a tight frame for U if there exists a constant /? > such that 

n 

Y,\{x,^i)\' = l3'\\x\\\ (20) 

i=l 

for all X € If /3 = 1, the tight frame is said to be normalized; otherwise it is said to be 

/3-scaled^ 

Of course any orthonormal basis for U is a normalized tight frame for U. However, there also 
exist tight frames for U with n > r, which are necessarily linearly dependent. The redundancy of 
the tight frame is defined as p = n/r. 

Since 

n n / \ 

^\{x,ipi)\'^ = ^x*ipiip*x = {x, i^ifiif* x), (21) 

i=l i=l \ i / 



^More generally, the vectors (ft form a frame for U if there exist constants a > and (3 < oo such that 

n 
i = l 

for all X £ U 1^]. The lower bound ensures that the vectors (pi span ht; thus we must have n > r. If n < oo, then 
the right hand inequality is always satisfied with = ^ZT^i Thus, any finite set of vectors that spans U is a 

frame for In particular, any basis for W is a frame for W. However, in contrast to basis vectors, which are linearly 
independent, frame vectors with n > r are linearly dependent. A tight frame is a special case of a frame for which 
a = /3. 
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the fact that ( pO[ ) holds for ail x imphes that 

n 

Y,^,^*=(3^Pu. (22) 



1=1 



Conversely, if the vectors £ U satisfy (|2^), then (p]) implies that (pO|) is satisfied for all x £ U. 
We conclude that a set of n vectors ipi £ U forms a tight frame for li if and only if the vectors 



satisfy (22) for some /3 > 0. 



Comparing ( p^ ) with (14), we conclude that: 



Theorem 4 (Tight frames). A set of vectors ipi £ U forms a f3-scaled tight frame for U if and 
only if the scaled vectors (3~^(pi are the measurement vectors of a rank-one POVM on U. In 
particular, the vectors ipi form a normalized tight frame for U if and only if they are the measurement 
vectors of a rank- one POVM on lA. 

This fundamental relationship between rank-one quantum measurements and tight frames will 
be the basis for the developments in subsequent sections. In the next section, we define frame 
matrices in analogy to the measurement matrices of quantum mechanics. We then use Neumark's 
theorem to extend tight frames to orthogonal bases. Motivated by the least-squares measurement 
of quantum mechanics, in Section ^ we address the problem of constructing optimal tight frames. 



18 



4.1 Frame Matrices 

In analogy to the measurement matrix, we define the frame matrix F as the matrix of columns ipi , 
where the vectors (pi form a tight frame for U. From (|2^ ) it then follows that 

FF* = (23) 

The properties of a frame matrix F follow immediately from Theorem Q and Theorem ^: 

Theorem 5 (Frame matrices) . For a matrix F whose columns are n vectors in a complex Hilbert 
space Ti and for a constant /3 > 0, the following statements are equivalent: 

1. F is the frame matrix of a f3-scaled tight frame for an r-dimensional subspace U C Ti; 

2. f3~^F is a trans jector (partial isometry) between r-dimensional subspaces lA ^Ti and V Q C^; 

3. FF* = P'^Pu for an r-dimensional subspace U CH; 
4- F*F = P'^P-\; for an r-dimensional subspace V C C". 

A frame matrix F of a (3-scaled tight frame for an r-dimensional subspace U TC may be 
expressed as F = (3UZrV* , where U is a unitary matrix whose first r columns {n,,, 1 < i < r} are 
an orthonormal basis forlA, V is an nx n unitary matrix whose first r columns {fj, 1 < i < r} are 
an orthonormal basis for V, and is given by Equivalently, F = /? "UiV*. 

A frame matrix F of a (3-scaled tight frame is an isometry if restricted to V and scaled by (3'^ . 

A frame matrix F of a (3-scaled tight frame whose columns are n vectors in TC represents an 
orthogonal basis for U (i.e., is an orthogonal frame matrix^ if and only if its rank is n. Then 
F = (3U ZnV* , where Zn is given by ^idj), and F*F = (3'^In; i-e., all frame vectors have squared 
norm (3"^. 

If the vectors 1 < i < n} form a tight frame for U, then any x £ U may be expressed as a 
linear combination of these vectors: x = Y^iCiipi. When n > r, the coefficients in this expansion 
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are not unique. A possible choice is = /3 '^{ipi,x), because 

n 

^-2^((/.i,x)99i =/3-2fF*x = Pwx = x. (24) 

i=l 

This choice of coefficients has the property that among all possible coefficients it has the minimal 



norm 



The expansion of (|2J) is reminiscent of an expansion of x in terms of an orthonormal basis for 
U. However, whereas the vectors in an orthonormal expansion are linearly independent, the vectors 
(pi in (p4|) are linearly dependent when n > r. 

4.2 Neumark's Theorem and Construction of Tight Frames 

Neumark's theorem (Theorem ^ was derived based on the properties of measurement matrices. 
Since by Theorem ^ frame matrices of tight frames have essentially the same properties as mea- 
surement matrices of rank-one POVMs, we can now obtain an equivalent of Neumark's theorem 
for tight frames. The proof is essentially the same as the proof of Theorem |3|, so we omit it. 

Theorem 6 (Neumark's theorem for tight frames). Let F be a rank-r frame matrix, with n 
columns in a complex Hilbert space 7i that span an r-dimensional subspace U Q 7i. Then there 
exists an orthogonal frame matrix F with equal-norm orthogonal columns that span an expanded 
n-dimensional subspace U ^ U in a possibly expanded complex Hilbert space 7i ^ TC such that the 
projection PuF of F onto U is F. 

We remark that given a set of equal-norm orthogonal vectors inU ^U, their projections onto 
U will always form a tight frame for U Q . Combining this result with Theorem ^ we can conclude 
that a set of vectors forms a tight frame for U if and only if the vectors can be expressed as a 
projection onto U a set of orthogonal vectors with equal norm in a larger space U containing U. 

Starting with a given frame matrix F inU, the proof of Theorem H gives a concrete construction 
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of an orthogonal frame matrix F inU such that PuF = F. We now give two examples of this 
construction. We consider first an example in which dim TC < n, and then one in which dim TC > n. 
Example 1. Consider the four frame vectors (pi = [0.35 — 0.61]*, (^2 = [0.61 0.35]*, 993 = 
[0.5 — 0.5]*, and 994 = [0.5 0.5]*. The frame matrix associated with this frame is 



0.35 0.61 0.5 0.5 
-0.61 0.35 -0.5 0.5 



(25) 



we may check that F is indeed the frame matrix of a tight frame since FF* = I2. 

We wish to construct an orthogonal frame matrix F such that F = PuF. In the proof of 
Theorem ^ for the case dim 7i < n, we constructed an n x n unitary matrix F using the SVD 
F = UT,V* . Using this construction here, we obtain: 



U 



0.5 -0.87 
-0.87 -0.5 



10 
10 



0.70 





0.70 








-0.70 





-0.70 


0.68 


-0.18 


-0.68 


0.18 


-0.18 


-0.68 


0.18 


0.68 



• (26) 



We now define the extended frame matrix U in accordance with the proof of Theorem ^ The first 
two columns of U are uniquely defined as the first two columns of U with zeroes appended. The 
remaining two columns are arbitrary, as long as the resulting U is unitary. A possible choice is: 



U 



0.5 
-0.87 





-0.87 
-0.5 







0.5 
-0.87 






-0.87 
-0.5 



(27) 
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Then 



F = UV* = 



0.35 


0.61 


0.5 


0.5 


-0.61 


0.35 


-0.5 


0.5 


0.35 


0.61 


-0.5 


-0.5 


-0.61 


0.35 


0.5 


-0.5 



(28) 



We may immediately verify that F*F = I4; i.e., F represents an orthonormal set of vectors. 

Since the columns of F span a 2-dimensional Hilbert space U = H, the projection onto this 
space is given by 



Pu 



10 

10 







(29) 



and indeed F = PuF. 

Example 2. We now consider an example in which dim H > n. The construction of F is simpler 
than in the previous case because we do not have to extend H. Consider the three frame vectors 
(pi = i[l 1 1]*, ip2 = ^[-1 1 1]*, and ip3 = ^[\/2 0]*. The frame matrix associated with this 
frame is 



2 



1 -1 ^/2 

110- (30) 
1 1 

In order to verify that F is indeed the frame matrix of a tight frame, we again determine the 
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SVD F = UT.V*, which yields 





0.58 


0.82 







1 










0.87 





0.5 


u = 


0.58 


-0.4 


0.7 


, s = 





1 





, v = 


0.29 


-0.82 


-0.5 




0.58 


-0.4 


-0.7 















0.4 


0.58 


-0.7 



(31) 



From Theorem |5| we conclude that F is indeed the frame matrix of a tight frame since its nonzero 
singular values are all equal to 1; i.e., F is a, transjector. A basis for the subspace li spanned by 
the columns of F is the two vectors 



0.58 0.58 0.58 



, U2 



0.82 -0.4 -0.4 



(32) 



Thus, Py is given by 



-PW = ^ UiUi 
i=l 



10 
0.5 0.5 
0.5 0.5 



(33) 



and indeed FF* = Pu . 

We now define an extended frame matrix F such that F = PuF and F*F = I3. From the proof 
of Theorem |3|, we have 



F = UZ3V* = UV* = F + 



0.5 -0.5 0.7 
0.85 0.15 -0.5 
0.15 0.85 0.5 



where 



-"3 



0.7 -0.7 



V3 



0.5 -0.5 0.7 



(34) 



(35) 



Since Puu^v^ = 0, we have immediately that F = PuF. 
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5 Optimal Tight Frames 

It is often of interest to construct a tight frame from a given set of vectors {(pi, 1 < i < n}. Different 
constructions have been proposed in the hterature |6|, however, in the general case no opti- 
mahty properties are known for these different constructions. Using the least-squares measurement 



(LSM) developed in the context of quantum detection [28|, we now propose a systematic method 
of constructing optimal tight frames from a given set of vectors. 

Thus we seek to construct a tight frame of vectors {(/'i, 1 < i < n} from a given set of vectors 
{4>i, 1 < i < n} that span an r-dimensional space U '^TC. A reasonable approach is to find a set of 
vectors (pi £ U that are "closest" to the vectors (pi in the least-squares sense. Thus we seek vectors 
ipi that minimize the squared error E, defined by 



E = Y,{ei,e^), (36) 



i=l 

where denotes the ith error vector 

ei = (j)i- (fi, (37) 

subject to the constraint 



We may wish to constrain the scaling /? in (22), e.g., we may seek a normalized tight frame 



with P = 1. The optimal frame in this case is derived in Section p.l| and is referred to as the 
constrained least-squares frame (CLSF). Alternatively, we may choose the vectors {fi} and /? to 
satisfy ( p2|) and to minimize the squared error E of (|3^). The optimal frame is then referred to as 



the unconstrained least-squares frame (ULSF), and is derived in Section 5.2. 
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5.1 Constrained least-squares frame 

We first consider the problem of constructing a set of vectors {(pi,l < i < n} that minimize E of 
(|36|) , subject to the constraint 

n 

Y,^,^*=l3lPu, (38) 
1=1 

where /3g is specified. In the case of a normahzed tight frame (5^ = 1. 

If the vectors (/>i are mutually orthogonal with cpi) = /3q, then the solution to (36) satisfying 
the constraint (p8|) is simply ipi = (pi, 1 < i < n, which yields E = 0. 



To derive the solution in the general case, denote by F and ^ the kxn matrices whose columns 
are the vectors cpi and (pi, respectively. The squared error E of (l36|)-(|37|) may then be expressed in 
terms of these matrices as 

E = Tr{{^- F)*($ - F)) = Tr (($ - F)($ - F)*) . (39) 



The constraint (22) may then be restated as 

FF* = PiPu. (40) 



The least-squares problem of (39) seeks a frame matrix F that is "close" to the matrix If the 
two matrices are close, then we expect that the underlying linear transformations they represent 
will share similar properties. The SVD of $ specifies orthonormal bases for V and U such that the 
linear transformations ^ and <I>* map one basis to the other with appropriate scale factors. Thus, 
to find an F close to ^> we need to find a linear transformation F that performs a map similar to 



Employing the SVD ^ = UTjV*, we rewrite the squared error E of (|39D as 



E = Trii<^-F){<^-Fy)=Tr{U*{<^>-F){<^>-FyU) = Y,{d^,d^), (41) 

i=l 
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where 



(42) 



The vectors {ui, 1 < i < r} form an orthonormal basis for U. Therefore, the projection operator 
onto U is given by 

r 



i=l 



EssentiaUy, we want to construct a map F* such that the images of the maps defined by and 
F* are as close as possible in the squared norm sense, subject to the constraint 



FF* = PiYl 



(44) 



4 = 1 



The SVD of is given by = VJ:*U*. Consequently, 



(TiVi, 1 <i <r; 
0, r + I < i < k, 



(45) 



where denotes the zero vector. Denoting the image of Ui under F* by = F*Ui, for any choice 
of F satisfying the constraint (44) we have 



{ai,ai) = u*FF*Ui = < 



Pi l<i<r; 
0, r + I < i < k, 



(46) 



and 



(aj, Qj) = u*FF*Uj = 0, i / j. 



(47) 



Thus the vectors Oj, 1 < i < r, are mutually orthogonal with {ai,ai) = (3q and = 0, m + 1 < 
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i < k. Combining (H^) and we may express di as 



aiVi -tti, I <i <r; 
0, r + l<i<k. 



(48) 



Our problem therefore reduces to finding a set of r ortliogonal vectors with norm Pq that 
minimize E = X]i=i(^«' '^«)' where di = CTiVi — Ui. Since the vectors Vi are orthonormal, the 
minimizing vectors must be Oj = PoVi, 1 < i < r. 

Thus the optimal frame matrix F, denoted by Fc, satisfies 



F^Ui = ( 



PoVi, l<i<r; 
0, r + I < i < k. 



Consequently 



1=1 



We may express Fc in matrix form as 



(49) 



(50) 



Fc = PoUZrV* 



(51) 



where Zr is defined by (||). The residual squared error is then 



i=l 



1=1 



(52) 



Note that if the singular values cij are distinct, then the vectors Uj, 1 < i < r are unique (up to 
a phase factor e-'^*). Given the vectors Ui, the vectors Vi are uniquely determined, so the optimal 
frame vectors corresponding to Fc are unique. If, on the other hand, there are repeated singular 
values, then the corresponding eigenvectors are not unique. Nonetheless, the choice of singular 
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vectors does not affect Fc- Indeed, if the vectors corresponding to a repeated singular values are 
{uj}, then is a projection onto the corresponding eigenspace, and therefore is the same 

regardless of the choice of the vectors {uj}. Thus 

^ up* = - ^ UjU*^, (53) 

3 i 

independent of the choice of {uj}, and the optimal frame is unique. 
We may express Fc directly in terms of $ as 

Fe = /3o^((^*'^>)'/')^ (54) 

where (•)t denotes the Moore-Penrose pseudo-inverse [Hf. Indeed, ((<I)*<I))V2)t = V{{'iTT,Y/'^)W* , 
where ((S*S)^/^)^ is a diagonal matrix with diagonal elements l/oi for 1 <i <r and otherwise, 
so that $(($*$)i/2)t = UZrV*. 

Alternatively, Fc may be expressed as 

Fc = m^^*f'^)^^, (55) 

where (($#*)V2)t = [/((SS*) V2)t[/*. 

We note that the optimal frame vectors (pi satisfy 

{^lct>,) = [F*c^],, = l3o[^*^]]i\ (56) 

where denotes the iilh. element of the matrix. This relation may be used to derive bounds on 
the inner products in terms of the inner products {(pijCpj); see [p7|]. 
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5.1.1 Optimal orthogonal basis and the CLSF 

In the previous section, we sought the /3o-scaled tight frame that minimizes the least-squares error. 
We may similarly seek the optimal orthogonal vectors with norm Pq of the same form. We now 
explore the connection between the resulting optimal vectors both in the case of linearly independent 
vectors (pi {r = n), and in the case of linearly dependent vectors (r < n). 

Linearly independent vectors: If the vectors (pi are linearly independent and consequently $ has 
full column rank {i.e., r = n), then ( |54[ ) reduces to 



The optimal frame vectors (^^ are mutually orthogonal with equal norm Pq, since their Gram matrix 
is 



Thus, the optimal frame is in fact an optimal orthogonal basis for U. 

Linearly dependent vectors: If the vectors (pi are linearly dependent, so that the matrix ^ does 
not have full column rank (i.e., r < n), then the n frame vectors (fi cannot be mutually orthogonal 
since they span an r-dimensional subspace. We now try to gain some insight into the optimal frame 
vectors in this case. Our problem is to find a set of vectors that are as close as possible to the 
n vectors (pi, which lie in an r-dimensional subspace U. We now show that these vectors are the 
projections onto U of the set of norm-/?o orthogonal vectors in 7i that are closest to the vectors (pi. 

To see this, suppose we seek a set of orthogonal vectors tpi ^ TL with {(fi, (pi) = Pq that are as 
close as possible to the vectors (pi. From Theorem ^ we have that 




(57) 




(58) 



71 




(59) 



i=l 
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where U is the space spanned by the vectors tpi . 

Since there are at most r orthogonal vectors in U, imposing an orthogonahty constraint forces 
the optimal orthogonal vectors (fi to lie partly in the orthogonal complement U'^ . Each vector then 
has a component inU, (p^ , and a component in U^, <f>^^ ■ Using (p^), the component in U satisfies 



E '^^('^^)* = E Pv<^^^*Pu = PlPuPjjPu = PiPu, (60) 

i=l i=l 

where the last equality follows from the fact that U 'ZU. Now we rewrite the error E of (|3^) as 



i=l 

n 

= 5]((0.-^^,<^.-^^) + (c^r,^f )), (61) 



since {(fii — (p^ ^ (p^ ) = 0- From (pO|) we have that 



1=1 i=l i=l 



\i=l 



= nl3l-T,{PlPu) = {n-r)Pl, (62) 
independent of the choice of vectors (pi. Thus, minimization of E is equivalent to minimization of 

n 

E' = Y,{cl>,-(p^,,<^,-(p^,). (63) 



1=1 



Furthermore, from (30) the vectors form a /3o-scaled tight frame for U. 

We conclude that choosing a set of orthogonal vectors with equal norm /5o that minimize E 
is equivalent to choosing an optimal /3o-scaled tight frame for lA. The optimal orthogonal vectors 
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are not unique; however, their projections onto U are unique and are just the optimal /3o-scaled 
tight frame vectors. We may choose the projections of the optimal orthogonal vectors onto lA^ 
arbitrarily, as long as the resulting n vectors are orthogonal with norm /?o- A convenient choice is 



n 



Fc = Po^UiVi. (64) 



i=l 



Indeed, Theorem ^ shows that the optimal orthogonal vectors are just a realization of the 
optimal frame vectors. This theorem guarantees that any /3o-scaled tight frame may be realized by 
a set of orthogonal vectors with norm /?o in an extended space such that their projections onto the 
smaller space are the given frame vectors. Denoting by and (f^ the optimal frame vectors and 



orthogonal vectors, respectively, (|63|) asserts that 

= Pu^l (65) 

We summarize our results regarding the CLSF in the following theorem: 

Theorem 7 (Constrained least-squares frame (CLSF)). Let {(pi} be a set of n vectors in a 
k- dimensional complex Hilbert space TC that span an r- dimensional suhspace U '^Ti. Let {(pi} denote 



the optimal n frame vectors that minimize the least-squares error defined by (36)- (SI), subject to 



the constraint (38). Let <I> = UHV* be the rank-r k x n matrix whose columns are the vectors 4>i, 
and let Fcbe the k x n frame matrix whose columns are the vectors tp'^. Then the unique optimal 
Fc is given by 

r 

Fc = PoY^u^v* = PoUZrV* = /?of ((f *a>)i/2)t = /3o(($$*)i/2)tci>, 

i=l 

where Ui and Vi denote the columns of U and V respectively and is defined by (01). 
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The residual squared error is given by 



2 



1=1 



where {cj, 1 < i < r} are the nonzero singular values of^. 
In addition, 

1. If r = n, 

(a) Fe = f5oH^*^)-^/^; 

(h) F*Fc = PqIu, and the corresponding frame vectors are orthogonal with norm Po- 

2. Ifr< n, 

(a) Fc may be realized by the f3o-scaled optimal orthogonal frame matrix Fc = f3Q Y17=i '^i'^t — 

(b) the action of the two optimal vector sets in the subspace U is the same. 
5.2 Unconstrained least-squares frame 

We now consider the least-squares problem where the scaling of the frame is not constrained. Thus, 
we seek a set of vectors {fi} that minimize the squared error E of (|36[), subject to 



^^,^*=FF* = (3''Pu, (66) 

where F is the matrix of columns ipi, and f3 > 0. 

The derivation of the solution to this minimization problem is very similar to the derivation of 
the CLSF of Section Following the same steps, we can express E as 

k 

E = Y,{dudi), (67) 

i=l 
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where di = OiVi — ai. 

For any choice of F satisfying the constraint (^) we have 



{ai.ai) = u*FF*Ui = < 



1 < i < r; 

(68) 

0, r + l<i<k, 



and 

{ai,a,) = u*FF*u, = 0,i^j. (69) 

Thus the vectors Cj, 1 < i < r, are mutually orthogonal with (cj, Cj) = and aj = 0, r+1 < i < k. 

Our problem therefore reduces to finding a set of r orthogonal vectors aj with equal norm /? 
that minimize (p7|). Expressing £' as 



i=l 

where 3f?{-} denotes the real part, we see that minimization of E is equivalent to minimization of 

r r 

E' = r/32 -2Y,'yM{ai,v^)] = r(i^ - 2/? ^ ^7,3?{(a„ i;^)}, (71) 



i=l 1=1 



where Oj = Oj//?. To determine the optimal vectors we have to minimize E' with respect to P 
and di. Fixing di and minimizing with respect to /?, the optimal value of /? is given by 

- 1 

P= - Y,^i^{i^^^Vi)}. (72) 

i=l 

Substituting /? back into ([7l|) , we get that the vectors are chosen to maximize 
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subject to the constraint 



{ai,aj) = 6ij. 



(74) 



Since the vectors Vi are orthonormal, the minimizing vectors must he di = Vi, 1 < i < r. Substi- 
tuting into (^), 

^=-^a,5R{(t;„i;,)} = -^a, = a, (75) 



1=1 



1=1 



and a, = avi. 

Thus the optimal frame matrix F, denoted by F^, satisfies 



F>^ = < 



aVi, 1 < i < r; 
0, r + l<i<k. 



(76) 



Consequently 



Fu = a'^ 



UiV. 



i=l 



We may express F^ in matrix form as 



(77) 



K = aUZrV* = a$(($*$)^/2)t = a(($$*)i/2)t$^ 



(78) 



where Zr is defined by (|l|). The residual squared error is then 



r r 
^mm = ~ cri)'^{vi,Vi) = y^(a - di)^ 



i=l 



1=1 



(79) 



Recall that 5 = $*^> = thus Tr(5) = ^[^^ cif . Therefore 



(80) 



Note that as we expect E^^^^ < E^^^, where E^^^ and E^^^ are given by (^) and (p^) respectively. 
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with equality if and only if /3o = a- 

5.2.1 Optimal orthogonal basis and the ULSF 

We now explore the connection between the least-squares orthogonal vectors with unconstrained 
norm and the ULSF. 

Linearly independent vectors: If the vectors are linearly independent and consequently $ has 
full column rank [i.e., r = n), then ( [78[ ) reduces to 

The optimal frame vectors (p^ are mutually orthogonal with equal norm a^, 

F*Fc = a2($*$)-i/2$*$($*$)-i/2 ^ ^2j^ (82) 

and the optimal frame vectors are in fact the optimal orthogonal vectors. 

Linearly dependent vectors: If the vectors 4>i are linearly dependent, so that the matrix $ does 
not have full column rank (i.e., r < n), then the n frame vectors cpi cannot be mutually orthogonal 
since they span an r-dimensional subspace. In analogy to the constrained case we now show that 
the optimal orthogonal vectors are related to the optimal frame vectors through a projection onto 
the subspace U, spanned by the vectors 

Suppose we seek a set of orthogonal vectors (fi G Ti with equal norm that are as close as possible 
to the vectors (pi. From Theorem ^ we have that 

n 

Y,m = fi''Pw (83) 

i=X 

for some /? > 0, where U D U is the space spanned by the vectors ipi. Now, each vector ipi has a 
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component in U, (p^ , and a component in U-^ , 0{ . Using (p3|), the component in U satisfies 

n n 

(^f )* = Y,Pu^^^*Pu = P'PuPaPu = P^Pu. (84) 

i=l 

From (|83) we have that 



1=1 



i=l 1=1 i=l 



- Tr{(3'^Pu) = {n- r)/?^. (85) 



Rewriting the error E of ( |36D as in (SI) and using (pq), we conclude that minimization of E is 
equivalent to minimization of 



E' = Y,{4>^ - , <t>^ - 0i) + - r), (86) 

where from (84) the vectors 0^ form a /3-scaled tight frame for lA. 

Following the derivation of the ULSF, minimizing E' is equivalent to minimizing 

r r 

E" = r(3^ -2PJ2 ^^^{{a^,v^)} + (n - r)p^ = n/?^ - 2/? ^ a,3f?{(ai, (87) 

1=1 i=l 

where Oj = aj//3, a, = {F^)*Ui, and is the matrix of columns (f^ . Fixing di and minimizing 
with respect to P, the optimal value of P, denoted by /3, is given by 



~ 1 



n 

i=l 



Substituting /3 back into (|87|), we get that the vectors di are chosen to maximize (|73|) subject to 
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([74D. Thus, the minimizing vectors are di = Vi, 1 < i < r. Substituting into (88) we have that 



~ 1 ^ 1 r a 

13 = - S^ ai'R{{vi,Vi)} = - V'o-j = -a = -, 

i=l i=l ^ 

where a is defined in ( [75D and p is the redundancy of the frame. Thus the optimal projections are 
the columns of {l/p)Fu, where Fu is the frame matrix of the ULSF vectors. 

We conclude that choosing a set of orthogonal vectors with unconstrained norm that minimize 
E is equivalent to choosing an optimal unconstrained tight frame for U and scaling these optimal 
frame vectors by 1/p. The optimal unconstrained orthogonal vectors are not unique; however, 
their projections onto U are unique and are proportional to the optimal unconstrained tight frame 
vectors. We may choose the projections of the optimal orthogonal vectors onto arbitrarily, as 
long as the resulting n vectors are orthogonal with norm a/ p. A convenient choice is 

Fu = -y,Uivl (90) 

We summarize our results regarding the ULSF in the following theorem: 

Theorem 8 (Unconstrained least-squares frame (ULSF)). Let {0,;} be a set of n vectors in 
a k- dimensional complex Hilhert space 7i that span an r -dimensional subspace U Q 7i. Let {(^"} 
denote the optimal n frame vectors that minimize the least-squares error defined by 



subject to the constraint (66). Let $ = UT,V* be the rank-m k x n matrix whose columns are the 
vectors (pi, and let be the kxn frame matrix whose columns are the vectors (^". Then the unique 
optimal Fu is given by 

r 

Fu = aJ2^iV*i = aUZrV* = a$(($*$)i/2)t = a{{m*)^/^)^^, 

i=l 

where Ui and Vi denote the columns of U and V respectively, Zr is defined in a = ^ X^i=i 
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and {ai, 1 < i < r} are the nonzero singular values o/$. 
The residual squared error is given by 

r 

i=l 

In addition, 

1. If r = n, 

(a) Fu = a$(#*#)-i/2 . 

(b) F*Fu = aim o-'iT'd the corresponding frame vectors are orthogonal with norm a. 

2. If r < n, then {l/p)Fu Tfiay be TGdlizcd by the optiuidl oTthogoTidl fTduic TJidtvix Fu — 
Hp) Eti u,v* = {a/p)UZrV*. 

6 Connection with the Polar Decomposition 

We now show that the ULSF and the CLSF are related to the polar decomposition of the matrix 
Let $ denote an arbitrary kxn matrix, where k > n. Then $ has a polar decomposition [40, 



$ = HY, (91) 

where H is a. k x n partial isometry that satisfies II*II = In, and Y = ((I)*<I))^/2. The Hermitian 
factor Y is always unique; the partial isometry H is unique if and only if <I> has full column rank. 
If <I) = is the SVD of then a natural choice for H is 

H = UZnV*, (92) 
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where is given by ([Tq). If r = n, then this choice of H is unique. Otherwise H is not unique; 
however, its projection onto the column space lA oi ^ \& unique and is given by |E2| 



Eu = PuH = UZrV* = $(($*^>)^/2)t^ (93) 

where Zr is given by (||). 

Comparing ( p3D with (^) and (^), we conclude that the ULSF and CLSF are proportional to 
the (unique) projection onto U of the partial isometry H in a polar decomposition of Thus, the 
ULSF and CLSF can be computed very efficiently by use of the many known efficient algorithms 



for computing the polar decomposition (see e.g., p9^, H3, 4C, H4|). 



Recently the truncated polar decomposition (TPD), a variation on the polar decomposition, has 



been introduced [45| and has proved to be useful for various estimation and detection problems. As 
we now show, the columns of the TPD of a matrix <I> are just the closest normalized frame vectors 
to the columns (pi of 

Let ^ = UT,V* denote an arbitrary k x n matrix with rank r. Then the order-p TPD of ^ is 
the factorization 

Pu,^ = [UZpV*][V^*ZpV*] = HY, (94) 

where Pu^ is the orthogonal projection onto the space spanned by the first p singular vectors Ui 
of From ( |94[ ) it follows that the left-hand matrix in the order-r TPD of ^ is just the optimal 
normalized frame matrix Fc. Similarly, the left-hand matrix in the order-p TPD of with p < r, 
is the optimal normalized tight frame matrix corresponding to the vectors Pup4'i- 

Since the CLSF and ULSF are related to the polar decomposition of properties of these 
optimal frames can be deduced from properties of the polar decomposition (see e.g., 41, 43, 4^). 



For example, the CLSF or ULSF corresponding to two vector sets {(pi} and {V'i} are the same if 



and only if the corresponding frame matrices satisfy ^'^* = I'l'll^'l, where \X\ = X*X |46| 
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7 Comparison with Other Proposed Frame Constructions 

We now compare our results with previously proposed frame constructions. 

The most popular frame construction from a given set of vectors is the canonical frame. Given 
a set of vectors < i < n} the canonical frame associated with these vectors is the frame 



corresponding to the frame matrix 17, 22, 23] 



The canonical frame has many desirable properties. Its construction is relatively simple; it can be 
determined directly from the given vectors; and if the vectors (pi are linearly independent, then it 
produces an orthonormal basis for U 17 1. This construction was first proposed in the context 



of wavelets in |24], and plays an important role in wavelet theory |25, 27 1. However, no general 
optimality properties are known for the canonical frame. 

Comparing ( |95|) with (^), we see immediately that the canonical frame vectors are just the 
normalized tight frame vectors that are closest in a least-squares sense to the vectors {</>«}. Fur- 
thermore, the /?o-scaled tight frame vectors that are closest to the vectors {(pi} are the canonical 
frame vectors scaled by (Sq. 

From Theorem |8|, it follows that the canonical frame vectors are the tight frame vectors that 
minimize the least-squares error only if a = 1, i.e., only if Yll=i ~ Otherwise, the canonical 
frame is no longer the optimal tight frame in a least-squares sense. However, if we simply scale 
each of the canonical frame vectors by q, then the resulting frame minimizes the least-squares error 
among all possible tight frames. 
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We summarize our results regarding canonical frames in the following theorem: 

Theorem 9 (Canonical frames). Let {(pi} be a set of n vectors in a k- dimensional complex 
Hubert space TC that span an r- dimensional subspace U TC. Let <I> = UT,V* be the rank-r k x n 
matrix whose columns are the vectors (pi. Let Ui and Vi denote the columns of the unitary matrices 
U and V respectively, let < i < r} denote the nonzero singular values o/ <I>, and let be 

defined as in Let be the n canonical frame vectors associated with the vectors (pi, and let 
F denote the matrix of columns ipi. Then 

F = UZrV* = $(($*$)t)V2 = (($$*)l/2)t$. 

Ln addition, 

1. If r = n, 

(a) the canonical frame vectors form an orthonormal basis for U; 

(b) the canonical frame vectors are the closest orthonormal vectors to the vectors {(pi}, in a 
least-squares sense; 

(c) ifY^l=iCTi = r, then the canonical frame vectors are the closest orthogonal vectors with 
equal norm to the vectors {(pi}, in a least-squares sense; 

(d) define the scaled canonical frame vectors Lp[ = j3(pi. Then 

i. the scaled canonical frame vectors are the closest orthogonal vectors with norm (3 to 

the vectors {(pi}, in a least-squares sense; 
a. if (3 = (!/''') X^[=i fj, then the scaled canonical frame vectors are the closest orthog- 
onal vectors with equal norm to the vectors {(pi}, in a least-squares sense. 

2. Ifr< n, 

(a) the canonical frame vectors form a tight frame for lA; 
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(b) the canonical frame vectors are the closest normalized tight frame vectors to the vectors 

in a least-squares sense; 

(c) ifYll=i'^i ~ ^' then the canonical frame vectors are the closest tight frame vectors to 
the vectors {</>i}, in a least-squares sense; 

(d) Define the scaled canonical frame vectors ip\ = Then 

i. the scaled canonical frame vectors are the closest ji-scaled tight frame vectors to the 

vectors in a least-squares sense, 

a. if 13 = (I/'") X^i=i Cj, then the scaled canonical frame vectors are the closest tight 
frame vectors to the vectors in a least-squares sense. 



8 Optimal Frames For Geometrically Uniform Vector Sets 

An important issue in constructing frames from a given set of vectors, is to what extent the frames 
inherit the properties of the original vector set. In this section we consider the case in which 



the given vectors have a strong symmetry property, called geometric uniformity |29|. Under these 
conditions we can show that the optimal frame has the same symmetries as the original vector set. 

For simplicity, we consider only the optimal normalized tight frame, which from Theorem ^ 
coincides with the canonical frame. Since the canonical frame vectors are proportional to the 
vectors constituting the CLSF and the ULSF, the results extend in a straightforward manner to 
these more general constructions. 

A set of vectors S = 1 < i < n} is geometrically uniform (GU) if every vector in the set 
has the form (j)i = Ui(f), where (j) is an arbitrary vector and the matrices {Ui, 1 < z < n} are unitary 
and form an abelian group^ Q. 

If the vectors 0j are GU, then every row (or column) of the Gram matrix S = {{4'i,4'j)} is a 



^That is, Q contains the identity matrix J; if Q contains Ui, then it also contains its inverse ?7- ^; the product 
UiUj of any two elements of Q is in 5; and UiUj = UjUi for any two elements in Q |p7| . 
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permutation of the first row (or column) [^8| ; such a matrix will be called a permuted matrix. A 
set of vectors satisfying = cpi) for all i,j (as is the case e.g., for real vector sets) is GU 



if and only if the corresponding Gram matrix is a permuted matrix [42|. 

The canonical frame vectors corresponding to a GU vector set are conveniently characterized 
in terms of a Fourier matrix defined on an additive group G isomorphic^ to Q. Specifically, it is 
well known (see e.g., [47|) that every finite abelian group Q is isomorphic to a direct product G of 
a finite number of cyclic groups: Q = G = x • • • x Z^^, where is the cyclic additive group of 
integers modulo nt, and n = Yl^ Uf. Thus every element Ui ^ Q can be associated with an element 
g G G of the form g = {gi,g2, . . . ,gp), where gt G Z„j; this correspondence is denoted by Ui <-> g. 
Each vector (pi = Ui(p is then denoted as (j){g), where Ui ^ g. 

The Fourier transform (FT) of a complex-valued function (/? : G — > C defined on G = x 
• • • X Z„ is the complex- valued function ip : G ^ C defined by 



m = ^J2^h,g)ip{g), (96) 

where the Fourier kernel ih,g) is 

V 

{h,g) =Yle-^'''^'3t/m_ (97) 
t=i 

Here ht and g^ are the kth components of h and g respectively, and the product htgt is taken as an 
ordinary integer modulo n^. 

The FT matrix over G is defined as the n x n matrix = {-^{h,g), h,g ^ G}. The FT of a 
column vector (p = {^{g),g G G} is then the column vector = {(p{h),h G G} given by = Tip. 



* Two groups Q and Q' are isomorphic, denoted hy Q = Q' , \i there is a bijection (one-to-one and onto map) 
tp : Q ^ Q' which satisfies tp{xy) — (p{x)ip{y) for all x,y £ Q prf. 
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Since T is unitary, we obtain the inverse FT formula 



^ = r*^={ y(h,g)*m,9 ^G}. (98) 



Following the development in |28], we can now obtain the following result: 



Theorem 10 (Least-squares normalized tight frames for GU vector sets). LetS = {(pi = 
Ui(j),Ui G Q}, be a geometrically uniform vector set generated by a finite abelian group Q of unitary 
matrices, where (j) is an arbitrary vector, and let <I> be the matrix of columns (pi . Let G be an additive 
abelian group isomorphic to Q, let {(j){g),g G G} be the elements of S under this isomorphism, and 
let T be the Fourier transform matrix over G. Then the normalized tight frame that is closest in 
the least-squares sense to ^ is given by the frame matrix 

F = = u{h)J^*{h), 

heG 

where 

1. is the diagonal matrix whose diagonal elements are a{h)^^ when o"(/i) ^ andO otherwise, 

2. {cr(/i) = n^/'^y^s(/i), h G G} are the singular values of^, 

3. {s{h),h £ G} is the Fourier transform of the inner-product sequence , (j){g)) , g G G}; 
4- u{h) = (j){h)/a{h) when cr(h) ^ and otherwise, 

5. {(j){h),h E G} is the Fourier transform of {(j){g),g S G}; 

6. J-*{h) is the hth row of J-* . 

Finally, the frame matrix F has the same symmetries as ^. 
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8.1 Example of a GU vector set 



We now consider an example demonstrating the ideas of the previous section. (The same example 
was given in [^8|.) Further examples and applications of GU vector sets can be found in [ p^ . 
Consider the group Q n = 4 unitary matrices Ui , where 



Ui = h, U2 



-1 

1 










-1 









-1 



-1 









-1 

1 





= U2U3 



Let the GU vector set he S = {(/^j = Ui(/), 1 < i < 4}, where = ^[1 1 1 1]*. Then ^> is 



(99) 



(100) 



and the Gram matrix S is given by 



S 



2 
-1 
-1 





-1 -1 

2 

2 

-1 -1 



(101) 



Note that the sum of the vectors (pi is 0, so the vector set is linearly dependent. 

In this case Q is isomorphic to G = Z2 x Z2, i.e., G = {(0, 0), (0, 1), (1, 0), (1, 1)}. The multipli- 
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cation table of the group Q is 





Ui 


U2 


Us 


Ua 




Ui 


U2 


Us 


Ua 


U2 


U2 


Ui 


Ua 


Us 


Us 


Us 


Ui 


Ui 


U2 




C/4 


Us 


U2 


Ui. 



If we define the correspondence 

C/i^(0,0), f/2^(0,l), Us ^{1,0), C/4 ^(1,1), 



(102) 



(103) 



then this table becomes the addition table of G 



'^2 X ^2- 





(0,0) 


(0,1) 


(1,0) 


(1,1) 


(0,0) 


(0,0) 


(0,1) 


(1,0) 


(1,1) 


(0,1) 


(0,1) 


(0,0) 


(1,1) 


(1,0) 


(1,0) 


(1,0) 


(1,1) 


(0,0) 


(0,1) 


(1,1) 


(1,1) 


(1,0) 


(0,1) 


(0,0). 



(104) 



Only the way in which the elements are labeled distinguishes the table of (104) from the table of 
( |102| ); thus G is isomorphic to G. Comparing ( |102| ) and ( |104| ) with (101), we see that the tables 
and the matrix S have the same symmetries. 

Over G = Z2 X Z2, the Fourier matrix is the Hadamard matrix 



1 1 

-1 1 

1 -1 

-1 -1 



(105) 
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Using the equations of the theorem, we may find the canonical frame: 



1 



-1 



-1 



1 



1 



^/2 V2 -V2 -V2 



F = 



2V2 



(106) 



V2 -V2 V2 -V2 



1-1-1 1 

We verify that the columns (pi of F may be expressed as ipi = Uiipi, 1 < i < 4, where ifi = 
-^^[i- \/2 \/2 1]*. Thus the frame vectors ipi also form a GU set generated by Q. 
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